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1  Introduction 


In  this  report,  we  study  the  multiparty  communication  complexity  problem  of  the  multiparty  equality 
function  (MEQ): 


EQ(x i,  ■■>,xn) 


0  if  x’i  =  •  •  •  =  xn 
1  otherwise. 


(1) 


The  input  vector  x  =  (aq,-  •  •  ,xn)  is  distributed  among  n>  2  nodes,  with  Xi  known  to  node  i,  where 
Xj  is  chosen  from  the  set  {1,  •  •  • ,  M},  for  some  integer  M  >  0. 


1.1  Communication  Complexity 

The  notion  of  communication  complexity  (CC)  was  introduced  by  Yao  in  1979  [2],  who  investigated 
the  following  problem  involving  two  separated  parties  (Alice  and  Bob)  want  to  mutually  compute  a 
Boolean  function  that  is  defined  on  pairs  of  inputs.  Formally,  let  /  :  X  x  Y  H >  {0, 1}  be  a  Boolean 
function.  The  communication  problem  for  /  is  the  following  two-party  game: 

Alice  receives  x  €  X  and  Bob  receives  y  €  Y,  and  the  goal  is  for  them  to  compute  f(x,y), 
collaboratively.  Alice  and  Bob  have  unlimited  computational  power  and  a  full  description  of  /,  but 
they  do  not  know  each  other’s  input.  They  determine  the  output  value  by  exchanging  messages.  The 
computation  ends  when  either  Alice  or  Bob  has  enough  information  to  determine  f(x,  y),  and  sends  a 
special  symbol  “halt”  to  the  other  party. 

A  protocol  P  for  computing  /  is  an  algorithm,  according  to  which  Alice  and  Bob  send  binary 
messages  to  each  other.  A  protocol  proceeds  in  rounds.  In  every  round,  the  protocol  specifies  whose 
turn  it  is  to  send  a  message.  Each  party  in  his/her  turn  sends  one  bit  that  may  depend  on  his/her 
input  and  the  previous  messages  he/she  has  received.  A  correct  protocol  for  /  should  terminate  for 
every  input  pair  (x,y)  €  X  x  Y,  when  either  Alice  or  Bob  knows  f(x,y). 

The  communication  complexity  of  a  protocol  P  is  the  number  of  bits  exchanged  for  the  worst  case 
input  pair.  The  communication  complexity  of  a  Boolean  function  /  :  X  x  Y  i— >  {0, 1},  is  that  of  the 
protocols  for  /  with  the  least  complexity. 


1.2  Multiparty  Communication  Complexity 

There  is  more  than  one  way  to  generalize  communication  complexity  to  a  multiparty  setting.  The  most 
commonly  used  model  is  the  “number  on  the  forehead”  model  introduced  in  [1],  Formally,  there  is 
some  function  /  :  II  ”= , X%  (->•  {0,1},  and  the  input  is  (x±,X2,  ■  ■  ■  ,xn)  where  each  Xi  €  A/.  The  i-th 
party  can  see  all  the  Xj  such  that  j  /  i.  As  in  the  2-party  case,  the  n  players  have  an  agreed-upon 
protocol  for  communication,  and  all  this  communication  is  posted  on  a  “public  blackboard”.  At  the 
end  of  the  protocol  all  parties  must  know  /(aq,  ■  ■  ■  ,xn).  In  this  model,  the  communications  may  be 
considered  as  being  broadcast  using  the  public  blackboard.  Tight  lower  bounds  (at  least  up  to  constant 
factors)  often  follow  from  considering  two-way  partitions  of  the  set  of  parties. 


2  Models 

2.1  Communication  Model 

Instead  of  the  “number  on  the  forehand”  model,  we  consider  a  point-to-point  communication  model 
(similar  to  the  message  passing  model),  which  we  believe  is  more  realistic  in  networking  settings.  We 
assume  a  synchronous  fully  connected  network  of  n  nodes,  the  node  IDs  (identifiers)  are  common 
knowledge.  We  assume  that  all  point-to-point  communication  channels/links  are  private  such  that 
when  a  node  transmits,  only  the  designated  recipient  can  receive  the  message.  The  identity  of  the 
sender  is  known  to  the  recipient. 
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2.2  Protocol 


A  protocol  P  is  a  sequence/schedule  of  transmissions  and  computations  {010:2  •  •  •  «l(P)}-  Here  cq  = 
(Ti,  Ri,  fi(xTnT[+  (l)))  specifies  that  in  the  l- th  step,  node  7}  sends  a  channel  symbol  as  a  function 
fl{xTv  T+(l))  to  node  Ri,  with  TJ+(Z)  denoting  all  the  channel  symbols  party  Ti  has  received  up  to  step 
l  —  1.  In  the  rest  of  the  paper,  we  will  use  T,+  as  T,+  (Z)  when  it  is  clear  from  the  context.  L(P)  is  the 
length  of  protocol  P.  The  communication  complexity  of  protocol  P  is  defined  as 

HP) 

C(P)  =  £log2S,(P),  (2) 

i=i 

where  Si(P)  is  the  size  of  the  range  of  fi(xTrT+(l)),  i.e.,  the  number  of  possible  channel  symbols 
needed  in  step  l  of  protocol  P,  considering  all  possible  inputs.  If  only  binary  symbols  are  allowed,  the 
communication  complexity  simply  becomes  L(P). 

3  Problem  Definitions 

In  this  paper,  we  study  the  communication  complexity  of  computing  the  MEQ  function  in  a  distributed 
manner.  We  consider  two  versions  of  the  problem,  which  differ  in  where  the  MEQ  function  is  being 
computed. 

3.1  MEQ- AD:  Anyone  Detects 

We  first  consider  the  protocols  that  terminates  whenever  one  of  the  nodes  detects  a  mismatch.  Precisely 
speaking,  a  protocol  P  is  said  to  solve  the  MEQ-AD  problem  deterministically  if  by  the  end  of  the 
L(P)~ th  step,  every  node  computes  its  own  decision  EQi  €  {0, 1}  such  that 

EQi  =  ■  ■  ■  EQn  =  0  EQ(x\ ,  •  •  • ,  xn)  =  0.  (3) 

We  will  say  node  i  detects  a  mismatch  if  it  sets  EQi  =  1.  In  other  words,  if  the  inputs  to  the  n  parties 
are  not  identical,  there  must  be  at  least  one  node  that  detects  a  mismatch. 

3.2  MEQ-CD:  Centralized  Detect 

The  second  class  of  protocols  we  consider  are  the  ones  in  which  a  particular  node  is  assigned  to  perform 
mismatch  detection.  Without  loss  of  generality,  we  can  assume  that  node  n  has  to  perform  detection. 
Then  a  protocol  P  is  said  to  solve  the  MEQ-AD  problem  if  by  the  end  of  the  L(P)~ th  step,  node  n 
computes  decision  EQn,  which  is 

EQn  =  EQ(x  1,  •  •  • ,  xn).  (4) 


3.3  Goal 

Denote  T ad(p,  M)  and  T cn(n,  M)  as  the  set  of  all  protocols  that  solve  the  MEQ-AD  and  MEQ-CD 
problem  with  n  nodes,  each  of  which  is  given  an  input  value  xt  €  { 1 ,  -  •  • ,  M},  respectively.  We  are 
interested  in  finding  the  communication  complexity  of  the  both  problems,  which  are  defined  as 

Cad(ti,M)=  inf  C(P), 

P&TAD{nM) 

CCD(n,  M)  =  inf  C(P). 

PerCD(n,M) 
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(5) 

(6) 


It  is  worth  pointing  out  that  CadU,  M)  <  Ccd(it-,  M),  since  any  protocol  that  solves  the  MEQ-CD 
problem  solves  the  MEQ-AD  problem  with  the  same  n  and  M  as  well.  In  the  rest  of  this  report,  we 
will  mainly  focus  on  the  MEQ-AD  problem,  except  for  Section  10,  in  which  we  discuss  the  MEQ-CD 
problem. 


4  Upper  Bound  of  the  Complexity 


An  upper  bound  of  the  communication  complexity  of  both  versions  of  the  MEQ  problem  is  (n  — 
l)log2M,  for  all  positive  integer  n  >  2  and  M  >  1.  This  can  be  proved  by  a  trivial  construction:  in 
step  i,  node  i  sends  Xi  to  node  n,  for  all  i  <  n.  The  decisions  are  computed  according  to 


EQi 


EQ(x  i,  •••,£„)  ,i  =  n; 
0  ,i  <  n. 


(7) 


It  is  obvious  that  this  protocol  solves  both  the  MEQ-AD  and  MEQ-CD  problems  with  communication 
complexity  (n—  1)  log2  M,  which  implies  CAD(CD){ni  M)  <  (n—  l)log2M.  In  particular,  when  M  =  2k , 
we  have  CAD(CD)(n^k)  <  (n  —  1  )k. 


5  Loose  Lower  Bound  of  Complexity  using  Traditional  Techniques 

In  most  of  the  existing  literature  on  multiparty  communication  complexity,  the  “number  on  the  fore¬ 
hand”  model  or  a  broadcast  communication  model  is  usually  assumed.  Under  these  models,  when  a 
node  transmits,  all  other  nodes  receive  the  same  message.  This  broadcasting  property  makes  it  pos¬ 
sible  to  consider  two-way  partitions  of  the  set  of  nodes  since  the  nodes  in  each  partition  shares  the 
same  information  being  broadcast  and  can  be  viewed  as  one  virtual  node.  Thus  results  from  two-party 
communication  complexity  can  be  extended  to  the  multiparty  case,  and  tight  bounds  (rather  than  just 
capturing  the  order)  can  then  be  obtained. 

However,  the  above  technique  no  longer  works  well  in  obtaining  tight  bounds  under  our  point-to- 
point  communication  model.  For  example,  the  complexity  of  the  two-party  EQ  problem  of  fc-bit  inputs 
can  be  proved  to  be  k  with  the  “fooling  set”  argument:  Suppose  in  contradiction  that  there  exists  a 
protocol  of  complexity  at  most  C(P)  <  k  that  solves  the  two-party  EQ  problem.  Then  there  are  at 
most  2clyP'>  <  2k  —  1  communication  patterns  possible  between  the  two  nodes.  Consider  all  sets  of  2k 
pairs  of  input  values  (x,  x).  Using  the  pigeonhole  principle  we  conclude  there  exist  two  pairs  (x,x)  and 
(x' ,  x ')  on  which  the  communication  patterns  are  the  same.  It  is  easy  to  see  that  the  communication 
pattern  of  (x,x')  is  also  the  same  as  (x,x).  Hence,  the  nodes’  final  decisions  on  (x,x)  must  agree  with 
their  decisions  on  (x,x').  But  then  the  protocol  must  be  incorrect,  since  EQ(x,x')  =  1  /  EQ(x,x). 

The  “fooling  set”  argument  above  can  be  extended  to  the  case  with  n  >  2  nodes  and  arbitrary 
M  >  1:  partition  the  n  nodes  into  two  sets  (say  L  and  R),  there  must  be  at  least  M  patterns  of 
communication  between  the  two  sets  L  and  R.  By  applying  this  argument  to  all  possible  two-partitions 
such  that  \L\  =  1  and  \R\  =  n  —  1,  we  can  obtain  a  lower  bound  on  the  communication  complexity  as 

Tl 

Cad(CD)(jIiM)  >  2  log2  M-  (8) 

This  lower  bound  is  within  a  factor  of  1/2  of  the  upper  bound  we  obtain  previously,  which  implies  that 
E ad{CD)  Af)  =  @(ralog2M).  However,  we  can  show  that  the  lower  bound  of  ^  log2  M  is  generally 
not  achievable.  An  example  for  this  is  the  MEQ(3,4)  problem.  It  can  be  shown  that  Cad{ 3,4)  = 
Ccd( 3, 4)  =  4,  while  |  log2  M  =  3.  Details  can  be  found  in  Appendix  A. 

The  example  above  has  demonstrated  that,  under  our  point-to-point  communication  model,  we 
can  no  longer  extend  results  from  two-party  communication  complexity  to  multiparty  version  for  tight 
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bounds  in  the  way  it  has  been  done  under  the  broadcast  communication  models.  The  main  reason  for 
this  is  the  lack  of  modeling  of  the  “networking”  aspect  of  the  problem  in  both  the  two-party  model 
and  the  broadcast  communication  models.  In  the  two-party  model,  since  there  are  only  two  nodes,  no 
networking  is  necessary.  In  the  broadcast  communication  models,  all  the  nodes  share  a  lot  information 
from  the  broadcast  and  have  roughly  the  same  view  of  system,  which  makes  it  a  not-so-distributed 
network.  On  the  other  hand,  under  our  point-to-point  communication  modes,  each  node  may  only 
receive  information  from  a  subset  of  nodes;  it  is  even  possible  that  two  nodes  may  receive  information 
from  two  disjoint  sets  of  nodes.  As  a  result,  different  nodes  can  have  very  different  views  of  the  system. 
This  makes  the  problem  of  finding  the  tight  bound  of  communication  complexity  difficult,  and  new 
techniques  may  be  required. 

6  Equivalent  MEQ-AD  Protocols 

In  this  section,  we  considers  protocols  €  T AD{n,M).  A  protocol  P  is  interpreted  as  a  directed  multi¬ 
graph  G(V,  E),  where  the  set  of  vertices  V  =  {1,  •  •  • ,  n}  represents  the  n  nodes,  and  the  set  of  directed 
edges  E  =  {(Ti,  R\ ),•••,  (T^p),  -Rl(p))}  represents  the  transmission  schedule  in  each  step.  From  now 
on,  we  will  use  the  terms  protocol  and  graph  interchangeably,  as  well  as  the  terms  transmission  and 
link. 

We  will  say  that  a  protocol  P  is  not  better  than  protocol  P'  if  C(P )  >  C(P').  Two  protocols  P 
and  P'  are  said  to  be  equivalent  if  they  are  not  better  than  each  other.  The  following  lemma  says 
that  we  can  flip  the  direction  of  any  link  in  a  protocol  P  and  obtain  a  protocol  that  is  equivalent  to  P. 

Lemma  1  A  protocol  P  =  {aq  •  •  •  cq_icqcq+i  •  •  •}  with  oy  =  (7),  Rp  fi(xTt,  T;+))  is  equivalent  to  P'  = 
{aq  •  •  •  a/__ia(a(+1  •  •  •}  if  the  following  conditions  are  all  satisfied: 

•  ot[  =  (RhThf[{xRvR'l)).  Here  fl(xRl,Rf)  =  fi(xTl,Tf) \Xl=...=Xn=XRi  is  the  symbol  that  party 
Ri  expects  to  receive  in  step  l  of  protocol  P,  assuming  all  parties  have  the  same  input  as  xR( . 

•  «m  =  (Tm,  Rm,  frn{xTm u  Tm))  for  m>l. 

-  If  Tm  =  Rh  f'm(xTrn,T+ )  =  fm{xTrn,T+) \fl(XTi,T+)=f{(XRi,R+)  is  the  symbol  that  party  Rt 
sends  in  step  m,  pretending  that  it  has  received  fj(x Rl ,  Rff )  in  step  l  of  P. 

-  If  Tm  ±  Rh  f!n(xTm,T+)  =  fm(xTm,T+). 

•  Ti  first  computes  EQPl  in  the  same  way  as  in  P.  Then  Ti  sets  EQPl  =  1  if  fj  (xRl .  R.f  )  / 
fi(xTl,T+),  else  no  change. 

Proof:  There  are  two  cases: 

•  fi(xRn  Rf)  =  fl  (xTi ,  Tj+):  It  is  not  hard  to  see  that  in  this  case,  the  execution  of  every  step  is 
identical  in  both  P  and  P' ,  except  for  step  l.  So  for  all  i  T],  EQi  is  identical  in  both  protocols. 
Since  //( xRl,  R ;+)  =  fi(xRl,  T)+),  EQPl  remains  unchanged,  so  it  is  also  identical  in  both  protocols. 

•  fj  (xRl ,  /?7+ )  /  fi(xTt,  T+)'.  Observe  that  these  two  functions  are  different  only  if  the  inputs  are 
not  all  identical.  So  it  is  correct  to  set  EQPl  =  1. 

□ 

Let  us  denote  all  the  symbols  a  node  i  receives  from  and  sends  to  the  other  nodes  throughout  the 
execution  of  protocol  P  as  i+  and  i~ ,  respectively.  It  is  obvious  that  i~  can  be  written  as  a  function 
Fi(xi,i+),  which  is  the  union  of  fi(xi,i+(l)).  If  a  protocol  P  satisfies  Fi{xi,i+ )  =  Fi(xi)  for  all  i,  we 
say  P  is  individual- input-determined  (iid).  The  following  lemma  shows  that  there  is  always  an  iid 
equivalent  for  every  protocol. 
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Lemma  2  For  every  protocol  P ,  there  always  exists  an  iid  equivalent  protocol  P* ,  which  corresponds 
to  a  partially  ordered  acyclic  graph. 

Proof:  According  to  Lemma  1,  we  can  flip  the  direction  of  any  edge  in  G  and  obtain  a  new  protocol 
which  is  equivalent  to  P.  It  is  to  be  noted  that  we  can  keep  flipping  different  edges  in  the  graph,  which 
implies  that  we  can  flip  any  subset  of  E  and  obtain  a  new  protocol  equivalent  to  P. 

In  particular,  we  consider  a  protocol  equivalent  to  P,  whose  corresponding  graph  is  acyclic,  i  <  j 
for  all  (i,j)  €  E,  with  a  proper  re-labeling  of  the  indexes.  In  this  protocol,  every  node  i  has  no  incoming 
links  from  any  node  with  index  greater  than  i.  This  implies  that  the  symbols  transmitted  by  node 
i  are  independent  of  the  nodes  with  larger  indexes.  Thus  we  can  re-order  the  transmissions  of  this 
protocol  such  that  node  1  transmits  on  all  of  its  out-going  links  first,  then  node  2  transmits  on  all  of 
its  out-going  links,  ...,  node  n  —  1  transmits  to  n  at  the  end.  Name  the  new  protocol  Q.  Obviously  Q 
is  equivalent  to  P. 

Since  we  can  always  find  a  protocol  Q  equivalent  to  P  as  described  above,  all  we  need  to  do  now 
is  to  find  P* .  If  Q  itself  is  iid,  then  P*  =  Q  and  we  are  done.  If  not,  we  obtain  P*  in  the  following  way 
(using  function  F'),  which  is  similar  to  how  we  obtain  the  equivalent  protocol  P'  in  Lemma  1: 

•  For  node  1,  since  it  receives  nothing  from  the  other  nodes,  F\  (x\ .  1+)  =  F\{x\)  is  trivially  true. 

•  For  node  1  <  i  <  n,  we  modify  Q  as  follows:  node  i  computes  its  out-going  symbols  as  a  function 
F'(xi)  =  Fi(xi,i+\Xl=...=Xn=Xi),  where  i+  \Xl=...=Xn=x.  is  the  expected  incoming  symbols  to  node  i 
assuming  all  parties  have  the  same  input  as  i.  At  the  end,  node  i  checks  if  i+  \Xl=...=Xn=Xi  equals 
to  the  actual  incoming  symbols  i+ .  If  they  match,  nothing  is  changed.  If  they  do  not  match,  the 
inputs  can  not  be  identical,  and  node  i  can  set  EQ,  =  1. 


□ 

Lemma  2  shows  that,  to  study  Cad(?T  M),  it  is  sufficient  to  investigate  only  protocols  that  are  iid 
and  partially  ordered. 


7  MEQ-AD(3,6) 


Let  us  first  consider  MEQ-AD(3,6),  i.e.,  the  case  where  3  nodes  (say  A,  B  and  C)  are  trying  to  solve  the 
MEQ-AD  problem  when  each  node  is  assigned  one  out  of  six  values,  namely  {1,  2,  3, 4,  5,  6}.  According 
to  Lemmas  1  and  2,  for  any  protocol  that  solves  the  MEQ-AD  problem,  there  exists  an  equivalent 
iid  partially  ordered  protocol  in  which  node  A  has  no  incoming  link,  node  B  only  transmits  to  node 
C,  and  node  C  has  no  out-going  link.  We  construct  one  such  protocol  that  solves  MEQ-AD(3,6)  and 
requires  only  3  channel  symbols,  namely  {1,  2, 3},  per  link.  The  channel  symbol  being  sent  over  link  ij 
is  denoted  as  stj .  Table  1  shows  how  Sij  is  computed  as  a  function  of  x*. 


X 

1 

2 

3 

4 

5 

6 

SAB 

1 

1 

2 

2 

3 

3 

SAC 

1 

2 

2 

3 

3 

1 

SBC 

1 

2 

3 

1 

2 

3 

Table  1:  A  protocol  for  MEQ-AD(3,6) 


For  nodes  B  and  C,  they  just  compare  the  channel  symbol  received  from  each  incoming  link  with 
the  expected  symbol  computed  with  its  own  input  value,  and  detect  a  mismatch  if  the  received  and 
expected  symbols  are  not  identical.  For  example,  node  B  receives  sab(xa)  from  node  A.  Then  it  detects 
a  mismatch  if  the  received  sab(xa )  /  sab{xb)- 


5 


It  can  be  easily  verified  that  if  the  three  input  values  are  not  all  identical,  at  least  one  of  nodes  B 
and  C  will  detect  a  mismatch.  Hence  the  MEQ-AD(3,6)  problem  is  solved  with  the  proposed  protocol. 
The  communication  complexity  of  this  protocol  is  31og23  =  log2  27.  In  fact,  this  protocol  is  optimal 
in  the  sense  that  it  achieves  the  communication  complexity  of  MEQ-AD(3,6): 

Cad(3,  6)  =  log2  27.  (9) 

The  proof  of  optimality  can  be  found  in  Appendix  A. 

8  MEQ-AD(3,6ft) 

Now  consider  the  MEQ-AD(3,6/l)  problem.  One  way  to  solve  this  problem  is  to  extend  the  Table  1  as 
Table  2.  This  protocol’s  communication  complexity  is  21og2(6^/2)  +  log2  3  =  log2(36/t~1  x  27). 


X 

1 

2 

3 

4 

5 

6 

7 

8 

6ft-l 

Qh 

SAB 

1 

1 

2 

2 

3 

3 

4 

4 

672 

672 

$Ac 

1 

2 

2 

3 

3 

4 

4 

5 

672 

1 

SBc 

1 

2 

3 

1 

2 

3 

1 

2 

2 

3 

Table  2:  A  protocol  for  MEQ-AD(3,6fe) 


A  more  efficient  way  to  solve  this  problem  is  to  map  each  of  the  6h  input  values  into  a  /j- dimensional 
vector  in  the  vector  space  {1,  2,  3, 4,  5,  6}h,  and  then  solve  the  MEQ-AD(3,6)  problem  h  times,  one  for 
each  of  the  h  dimensions.  Using  the  optimal  MEQ-AD(3,6)  protocol  introduced  in  Section  7,  the 
communication  complexity  of  this  protocol  is  /?.log2  27  =  log2  27h  <  log2(36ft~1  x  27). 


9  MEQ-AD(3,2fc) 

Now  we  construct  a  protocol  when  the  number  of  possible  input  values  M  =  2fc,  k  >  1  and  only  binary 
symbols  can  be  transmitted  in  each  step,  using  the  MEQ-AD(3,6)  protocol  we  just  introduced  in  the 
previous  sections  as  a  building  block. 

First,  we  map  the  2k  input  values  into  2k  different  vectors  in  the  vector  space  {1,  2, 3, 4,  5, 6}ft, 
where  h  =  [~ k  log6  2] .  Then  h  instances  of  the  MEQ-AD(3,6)  protocol  are  performed  in  parallel  to 
compare  the  h  dimensions  of  the  vector.  Since  3  channels  symbols  are  required  for  each  instance  of 
the  MEQ-AD(3,6)  protocol,  we  need  to  transmit  a  vector  from  {1,2,  3}^  on  each  of  the  links  AB,  AC 
and  BC.  One  way  to  do  so  is  to  encode  the  3h  possible  vectors  from  {1,2, 3}^  into  b  =  [ h  log2  3]  bits, 
and  transmit  the  b  bits  through  the  links.  Since  the  h  instances  of  MEQ-AD(3,6)  protocols  solve  the 
MEQ-AD(3,6)  problem  for  each  dimension,  altogether  they  solve  the  MEQ-AD(3,2fc)  problem. 

The  communication  complexity  of  the  proposed  MEQ-AD(3,2fc)  protocol  can  be  easily  computed 
as 


= 

3  r /*■  log2  3] 

(10) 

= 

3  f  7  log6  2]  log2  3] 

(11) 

< 

3  T  (k  log6  2  +  1)  log2  3] 

(12) 

< 

3  [(k  log6  2  +  1)  log2  3  +  1] 

(13) 

= 

3 k  logg  3  +  3(log2  3  +  1) 

(14) 

< 

1.8401b  +  7.755 

(15) 
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Figure  1:  Complexity  of  the  proposed  protocol  v.s.  upper  bound  2k 


From  Eq.15,  we  can  see  that  when  k  is  large  enough,  the  communication  complexity  this  pro¬ 
tocol  becomes  smaller  than  the  upper  bound  21og2M  =  2k  from  Section  4.  In  Fig.l,  we  plot  the 
communication  complexity  of  this  protocol,  according  to  Eq.ll.  It  is  easy  to  see  that  for  k  >  39, 
C{P)  <  2k. 

The  way  in  which  the  above  protocol  is  constructed  can  be  generalized  to  obtain  a  MEQ-AD(3,M) 
protocol  P  with  complexity 

C(P)  <  1.840  log2M  +  A  (16) 

for  arbitrary  value  of  M,  where  A  is  some  positive  constant. 

10  About  MEQ-CD 

In  this  section,  we  will  show  that  CcD{n,  M )  roughly  equals  to  Cad(it-,  M ): 

Cad{ji,  M)  <  CcD{n ,  M)  <  Cad{ji ,  M)  +n  —  2.  (17) 

We  have  shown  the  lower  bound  in  Section  3.  We  will  now  prove  the  upper  bound. 

Consider  any  partially  ordered  iid  protocol  P  E  T  AD{n,  M)  as  described  in  Lemma  2.  We  construct 
a  protocol  P'  by  having  node  i  to  send  EQi  to  node  n  by  the  end  of  P,  for  all  1  <  i  <  n.  Node  n 
collects  n  —  1  decisions  (including  EQn  as  computed  in  P)  from  all  nodes  except  for  node  1.  Then 
node  n  compute  the  final  decision 


EQ'n  =  umx{EQ2,- ■  ■  ,EQn}-  (18) 

It  is  easy  to  see  that,  EQ'n  =  EQ(xi,  ■  ■  ■ ,  xn).  So  P'  E  Yco{n,M).  Since  C(P')  =  C(P )  +n  —  2, 
the  upper  bound  is  proved.  From  Eq.16  it  then  follows  that  there  exist  a  protocol  P'  that  solves 
MEQ-CD(3,M)  with  complexity  C(P')  <  21og2  M,  for  large  enough  M. 
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11  Conclusion 


In  this  report,  we  study  the  communication  complexity  problem  of  the  multiparty  equality  function, 
under  the  point-to-point  communication  model.  We  demonstrate  that  traditional  techniques  general¬ 
ized  from  two-party  communication  complexity  problem  are  not  sufficient  to  obtain  tight  bounds  under 
the  point-to-point  communication  model.  We  then  introduce  techniques  to  transform  any  MEQ-AD 
protocol  into  a  equivalent  partially  ordered  iid  protocol.  These  techniques  significantly  reduce  the  space 
of  MEQ-AD  protocols  to  study.  We  then  study  the  MEQ-AD (3,6)  problem  and  introduce  an  optimal 
protocol  that  achieves  Cad( 3,6).  This  protocol  is  then  used  as  building  blocks  for  construction  of 
efficient  protocols  for  MEQ-AD(3,6ft)  and  MEQ-AD(3,2fc).  The  problem  of  finding  the  communication 
complexity  of  the  MEQ  problem  for  general  values  of  n  and  M  is  still  open. 
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A  Edge  Coloring  Representation  of  MEQ-AD (3, M) 

From  Sections  6  and  7,  we  have  shown  that  it  is  sufficient  to  study  3-node  systems  where  information 

is  transmitted  only  on  links  AB,  AC  and  BC.  Let  us  denote  |sab|,  I  Me  I  and  |sbc|  as  the  number  of 

different  symbols  being  transmitted  on  links  AB,  AC  and  BC,  respectively.  Now  consider  the  following 
simple  bipartite  graph  G(U.  V,  E),  where  U  and  V  are  the  two  disjoint  sets  of  vertices  and  E  is  the  set 
of  edges: 

•  \U\  =  | sab |,  each  vertex  is  labeled  as  USAB rx\  for  all  M  values  of  x; 

•  |Vj  =  |sac|)  each  vertex  is  labeled  as  VSACtx)  for  all  M  values  of  x; 

•  etj  =  ( Ui ,  Vj)  G  E  if  and  only  if  i  =  sab(x )  and  j  =  sac(x)  f°r  some  x. 

In  essence,  each  vertex  Ui  (or  V))  represents  the  set  of  value  x’s  that  produce  the  same  value  sab(x)  =  i 
(or  sac{x )  =  *);  and  each  edge  =  (Ui,  Vj)  represents  the  set  of  value  x’s  that  produces  the  same  pair 
of  channel  symbols  sab(x)  =  i  and  sac(x)  =  j.  Let  | 1  be  the  size  set  of  value  x’s  corresponding  to 
edge  eij.  Fig.2  shows  the  bipartite  graph  corresponding  to  the  MEQ-AD(3,6)  protocol  we  introduced  in 
Section  7.  Near  the  nodes  U  and  V)  we  show  the  set  of  value  x’s  such  that  sab{x)  =  i  and  sac(x )  =  i, 
respectively.  The  number  near  each  edges  is  the  corresponding  value  of  that  edge. 

We  first  argue  that 

Lemma  3  | e^j  =  1  for  all  G  E.  Hence  \E\  =  M  and  \U\  x  \V\  >  M. 

Proof:  Suppose  to  the  contrary  that  there  exists  some  G  E  with  \eij\  >  2.  Then  there  must  be 
two  values  x,x'  such  that  x  /  x' ,  sab(x)  =  sab(x')  and  sac(x )  =  sac(x').  Similar  to  the  “fooling 
set”  argument  in  Section  5,  it  is  impossible  for  nodes  B  and  C  to  tell  the  difference  between  the  two 
input  vectors  (x,  x,x)  and  (x',x,x),  hence  they  will  not  be  able  to  solve  the  MEQ-AD(3,M)  problem, 
which  leads  to  a  contradiction.  Then  the  first  part  of  the  lemma  follows. 


1,6 

2,3 

4,5 


Figure  2:  The  bipartite  graph  corresponding  to  the  MEQ-AD(3,6)  protocol  from  Section  7. 

Since  every  edge  represents  one  x,  and  there  are  M  possible  values  of  x,  it  follows  that  \E\  =  M. 
Also,  in  a  simple  bipartite  graph,  we  always  have  \U\  x  |Vj  >  \E\.  Thus  \U\  x  \V\  >  M. 

□ 

Since  there  is  a  one-to-one  mapping  from  the  set  of  input  values  {1 ,  •  •  • ,  M}  to  the  edges  E,  we 
will  use  the  terms  input  value  (x)  and  edge  (e^)  interchangeably.  Now  we  prove  the  following  theorem 
on  the  constraint  of  sbc'- 

Lemma  4  sbc(x)  /  sbc(x')  if  edges  x  and  x'  are  adjacent  or  there  is  some  other  edge  that  is  adjacent 
to  both  of  them,  in  the  bipartite  graph  G(U ,  V,  E). 

Proof:  Consider  any  pairs  x,  x'  such  that  x  /  x'  and  ( USABrx ),  V8ac(x'))  ^  Let  x*  (maybe  equal  to 
x  or  x')  be  the  input  value  that  edge  ( USAB m,  VSAC(X'))  corresponds  to.  So  we  have  sab(x*)  =  sab(x) 
and  sac(x*)  =  sac{x').  Now  consider  the  input  vector  (x*,x,x')  at  node  A,  B  and  C.  Since  sab{x*)  = 
sab(x),  node  B  can  not  differentiate  (x*,x,xf)  from  (x,x,  x).  So  node  B  cannot  detect  the  mismatch. 
Meanwhile,  since  sac(^*)  =  sac(,x'),  node  C  can  not  differentiate  (x*,x,  x')  from  (x',x',x')  by  just 
looking  into  the  receives  sac ■  So  sbc(x )  must  be  different  from  sbc{x '),  otherwise  the  MEQ-AD 
problem  is  not  solved.  Then  the  lemma  follows.  □ 

Now  we  can  conclude  that  the  problem  of  designing  sbc ,  given  functions  sab(-)  and  sac(0> 
is  equivalent  to  finding  a  distance- 2  edge  coloring  for  the  corresponding  bipartite  graph  G(U,V,  E). 
Furthermore,  it  should  not  be  hard  to  see  that  any  protocol  P  that  solves  MEQ-AD(3,M)  is  equivalent 
to  a  bipartite  graph  G(U,V,E)  together  with  a  distance-2  coloring  scheme  W  such  that  \U\  =  |sab|, 
\V\  =  |  sac  I  >  1-^1  =  Af,  and  \W\  =  IsbcI)  where  \W\  denotes  the  number  of  colors  in  scheme  W.  Notice 
that 

C(P)  =  log2  +  log2  |  sac  I  +  log2  |  sbc  I  =  log2  ( |  sab  |  x  |sac|  x  |sbc|)-  (19) 

So  we  have  the  following  theorem: 

Theorem  1  The  existence  of  a  MEQ-AD (3, M )  protocol  P  with  complexity  C(P)  is  equivalent  to  the 
existence  of  a  simple  bipartite  graph  G(U,  V,  E)  together  with  a  distance-2  coloring  scheme  W  such  that 
\U\  x  \V\  x  \W\  =  2C(P\  given  \E\  =  M,  \U\  x  \V\  >  M,  \U\  x  \W\  >  M  and \V\  x  \W\  >  M. 

Proof:  The  last  two  conditions  come  from  the  fact  that  we  can  flip  the  directions  of  the  links  and 
then  apply  Lemma  3.  □ 

According  to  Theorem  1,  we  can  conclude  that  the  problem  of  finding  Cad{2> ,  M)  is  equivalent  to 
the  problem  of  finding  the  minimum  of  \U\  x  \  V\  x  \  W\  for  the  bipartite  graphs  and  distance-2  coloring 
schemes  that  satisfy  the  above  constrains. 

Using  Theorem  1,  to  show  that  Cad{3 , 4)  =  4,  we  only  need  to  show  that  for  every  combination  of 
\U\  x  |Vj  x  \W\  <  24  =  16  there  exists  no  bipartite  graph  G(U,  V,  E)  and  distance-2  coloring  scheme  W 
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that  satisfy  the  conditions  as  described  in  Theorem  1.  In  other  words,  if  the  conditions  are  all  satisfied, 
then  the  bipartite  graph  G(U,  V,  E )  cannot  be  distance-2  colored  with  \W\  colors.  It  is  not  hard  to  see 
that  there  are  only  two  combinations  (up  to  permutation)  that  satisfy  all  conditions  and  have  product 
less  than  16:  (2,  2,  2)  and  (2,  2,  3).  Notice  that  in  both  cases,  | E\  =  \U\  x  |V|,  where  every  pair  of  edges 
are  within  distance  of  2  of  each  other,  which  means  graph  G  can  only  be  distance-2  colored  with  at  least 
\E\  colors.  Together  with  the  upper  bound  from  Section  4,  this  proves  that  Cad(3,  4)  =  Ccd{ 3, 4)  =  4. 

Similarly,  it  can  be  shown  that  Cad(3,  6)  =  log2  27.  There  are  only  two  combinations  that  satisfy 
all  conditions  in  Theorem  1  and  have  product  less  than  27:  (2,  3,  3)  and  (2,  3,  4).  Again,  \E\  =  \U\  x  |V|, 
so  at  least  \E\  =  6  colors  are  needed,  which  proves  that  Cad( 3,  6)  =  log2  27. 
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